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Abstract 

Alpha-satellite DNA (AS) is a main DNA component of primate centromeres, consisting of tandemly 
repeated units of 170 bp. The AS of humans contains sequences organized into higher-order repeat 
(HOR) structures, in which a block of multiple repeat units forms a larger repeat unit and the larger units 
are repeated tandemly. The presence of HOR in AS is widely thought to be unique to hominids (family 
Hominidae; humans and great apes). Recently, we have identified an HOR-containing AS in the siamang, 
which is a small ape species belonging to the genus Symphalangus in the family Hylobatidae. This result sup- 
ports the view that HOR in AS is an attribute of hominoids(superfamilyHominoidea) rather than hominids. A 
single example is, however, not sufficient for discussion of the evolutionary origin of HOR-containing AS. In 
the present study, we developed an efficient method for detecting signs of large-scale HOR and demonstrated 
HOR of AS in all the three other genera. Thus, AS organized into HOR occurs widely in hominoids. Our results 
indicate that (i) HOR-containing AS was present in the last common ancestor of hominoids or (ii) HOR-con- 
taining AS emerged independently in most or all basal branchesof hominoids. We have also confirmed HOR 
occurrence in centromeric AS in the Hylobatidae family, which remained unclear in our previous study 
because of the existenceof AS in subtelomeric regions, in addition to centromeres, of siamang chromosomes. 
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1 . Introduction 

The centromere is the part of the chromosome to 
which spindle fibres attach via the kinetochore, and it 
serves as the forefront of chromosome migration 
during celldivision. The main DNA components of cen- 
tromeres are usually tandemly repeated sequences, 
and the most abundant sequence in primate centro- 
meres is alpha-satellite DNA (AS), which has basic 
repeat units of ~1 70 bp in length.''^ Human AS has 
been classified into two types according to the organ- 
ization and nucleotide sequence properties of its 
repeat units.^'"^ One is monomeric AS, which is a 



simple repetition of the basic repeat units. The other 
is higher-order AS, whose sequence is organized into a 
higher-order repeat (HOR) structure, in which a block 
of multiple basic repeat units forms a larger repeat 
unit. In a typical HOR structure, any combination of 
two basic repeat units within the larger repeat unit 
shows nucleotide sequence identities of 70-90%, 
whereas the identity often exceeds 95% between a 
basic repeat unit and its counterpart (located at the 
same position) in another larger unit. Higher-order AS 
is known to be present in all normal human centro- 
meres,"^ to occupy central portions of centromeres 
(whereas monomeric AS is found mostly in peripheral 
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regions),^ and to be more important than monomeric 
AS in centromere function. ^■'^'^ It is unclear, however, 
what aspects of higher-order AS are associated with 
these facts. One approach to identifying these associa- 
tions would be to clarify the history of higher-order 
AS, including its evolutionary origin and expansion. 
The present study addresses the evolutionary origin of 
higher-order AS. 

Humans (genus Homo) are included in hominids 
(family Hominidae), along with chimpanzees (genus 
Pan), gorillas (genus Gorilla), and other great apes. 
Small apes (gibbons) are not included in hominids and 
form another taxon (family Hylobatidae). Collectively, 
organisms included in these two families are called 
hominoids (superfamily Hominoidea). All hominids 
examined to date carry higher-order AS.^'^ In small 
apes, however, AS of this type had not been found until 
we recently reported one in the siamang (Sympha- 
langus syndactylus).'^ 

Although the HOR structure observed in siamang AS 
was clear (HOR units consisting of four or six basic 
repeat units), there remained two unresolved questions: 
(i) whether HOR structures are common among 
small apes or are limited to the siamang and (ii) 
whether AS residing in the centromere (without migrat- 
ing to another chromosomal position) exhibits HOR 
structures or if HOR is limited to non-centromeric AS in 
small apes. The second question arose because of the 
fact that the siamang carries AS in terminal regions of 
chromosomes as large blocks of constitutive heterochro- 
matin as well ascentromeres.' °Thefunction and forma- 
tion processes of these heterochromatin blocks are not 
yet clear. Similar structures have also been observed in 
chromosomes of chimpanzees and gorillas. However, 
their DNA component is not AS, but is an unrelated 
tandem repeat sequence (called the StSat sequence) 
comprising 32-bp repeat units.' ^'^^ In the present 
study, we obtained results that can be used to answer 
the two questions described above. Our results indicate 
that higher-orderAS widely occurs in hominoids. 

Str uctu res of ta nde m re peat seq ue nces a re d iff icu It to 
accurately determine using data from whole-genome 
shotgun sequencing or next-generation sequencing, 
because of difficulty in assembling sequence reads into 
contigs. Primer walking on a cloned DNA fragment is 
also inadequate because of the presence of multiple 
primer annealing sites in the clone. In fact, centromere 
regions are left as large gaps even in the human 
genome database.' ^''"^ To overcome this problem, we 
used a traditional method consisting of cloning long 
genomic DNA fragments, preparation of deletion 
clones, and sequencing end regions of these clones. 
There was, however, another problem to be solved 
before sequencing analyses: which clones should be 
sequenced? Our idea was to detect signs of HOR struc- 
tures by comparing sequences of two different regions 



within a clone. For this purpose, we used the transpos- 
ition reaction of a bacterial transposon, which provided 
AS sequences with uniquesitesforannealing by sequen- 
cing primers. Below, we describe our methods, results, 
and their significance with regard to the evolutionary 
origin of higher-orderAS. 



2. Materials and methods 

2.1 . Animals for collection of genomic DNA 

In the latest taxonomy of small apes, the family 
Hylobatidae consists of four genera: Hoolock, Hylobates, 
Nomascus, and Symphalangus.^^'^^ A species of the 
genusSymphalangus (S. syndactylus) has been examined 
in a previous study.' ° In the present study, we focused on 
the other three genera. The species and the origin of the 
individuals that served as DNA sources were as follows: 
the Western hoolock gibbon (Hoolock hoolock), an 
adult female (named Faisal) bred at Bangabandhu 
Sheikh Mujib Safari Park, Bangladesh; the agile gibbon 
(Hylobates agilis), an adult male (named Raja) bred at 
the Primate Research Institute of Kyoto University; and 
the Southern white-cheeked gibbon (Nomascus siki), 
an adult female (identification number 001064376) 
bred at Khao Kheow Forest and Wildlife Park, Thailand. 

2.2. Preparation of genomic libraries 

We collected genomic DNA from white blood cells by 
following a standard method. A genomic library of each 
species was constructed, following a previously described 
protocol.' °'' ^Thevectorwasan 8.1 -kbfosmid pCCl FOS, 
and the inserts were 40- to 44-kb genomic DNA frag- 
ments that were generated by mechanical shearing 
and isolated by gel electrophoresis and subsequent re- 
covery from a gel piece. Identification of AS-containing 
sequences from these libraries is described in the 
Results section together with strategies and results. 

2.3. DNA sequencing 

We performed subcloning of a part of the fosmid 
clones, preparation of a series of deletion clones from 
the subclone plasmids and sequenced the deletion 
clones by using the Ml 3 universal primer. The details 
of our strategy are described in the Results section 
along with the results. The sequencing method used 
was Sanger's method, which involved using an Applied 
Biosystems 3730x/ DNA Analyzer. 

2.4. Fluorescent i n situ hybridization analysis 
of chromosomes 

We performed f I uorescent/tt situ hybridization (FISH) 
analysis following the procedures described previous- 
ly.' ° Specific conditions are explained in each case. 
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3. Results 

3.1 . Collection of genomic DNA clones containing 
repetitive sequences 

We screened the genomic libraries of the three 
gibbon species for repetitive sequences by the genomic 
hybridization method described previously' In this 
screening, the target of the hybridization was fosmid 
clones randomly selected from the library and fixed on 
nylon membranes. The probe was genomic DNA (frag- 
mented by mechanical shearing in advance) of the 
same species as that of the target DNA. For example, 
fragmented genomic DNAof/-/oo/oc/<:/joo/oc/^ was chem- 
ically labelled and used as probe in the hybridization 
assay for Hoolock hoolock fosmid clones. If a fosmid 
carries repetitive DNA (such as AS) as its insert, the 
same sequence occupies a large fraction of the probe 
DNA. A large number of labelled DNA fragments anneal 
to this fosmid, and the fosmid produces an intense 
signal. If a fosmid clone carries a single-copy region, the 
same sequence is present at a low frequency in the la- 
belled DNA, and the fosmid displays a weak (or even un- 
detectable) signal. A big advantage of this method is that 
any repetitive sequences are identified irrespective of 
their nucleotide sequences (even if the sequences are 
unknown), with more chances of identification (stronger 
signals and larger numbers of signals) for sequences of 
higher copy numbers.' ^ For each species, we screened 
384 fosmid clones (four membranes corresponding to 
four 96-well plates) and picked up 1 2 clones exhibiting 
the highest level of signal intensities. 

3.2. Selection ofAS-containing clones 

We sequenced one end of the clones obtained by 
genomic hybridization and selected clones whose se- 
quence reads showed tandem repeat structures with 
unit sizes between 1 00 and 200 bp (the size of the 
basic repeat units of AS is uniform among hominoids at 
~170bp and is included in this range). Of the 12 
Hoolock, 1 2 Hylobates, and 1 2 Nomascus clones, 4, 9, 
and 6 clones, respectively, met this criterion. Sequence 
reads from all of these clones exhibited >70% nucleo- 
tide identity with the consensus sequence of human AS 
and with thatof siamangAS.'° 

3.3. Detection of signs of HOR 

We first tried to detect HOR signs by comparing the 
sequences of two terminal regions (~900 bp each), 
which we had successfully carried out in the case of 
Symphalangus in our previous study.^ This test resulted 
in an HOR sign in one (HylFosI 5) of the clones from 
Hylobates (data not shown). The test, however, did not 
give a positive result in any of the four clones from 
Hoolock or six clones from Nomascus, suggesting that, 
even if these clones contain an HOR structure, its scale 



(the number of basic repeat units constituting an HOR 
unit) is larger than that found in Symphalangus. We 
developed another method to obtain longer sequences, 
with the expectation that longer sequences would in- 
crease the possibility of detecting signs. The strategy 
used for this purpose is shown in Fig. 1 . We prepared a 
clone of the Tn5 transposon that carried the kanamy- 
cin-resistance gene. We mixed it with an AS-containing 
genomic DNAcloneand induced the transposition reac- 
tion by supplying the Tn5 transposase enzyme. The 
fosmid vector (pCCl FOS) for the genomic DNA clone 
carried the chloramphenicol-resistance gene. After the 
transposition reaction, the reaction mixture was intro- 
duced into bacterial cells by electroporation, and single 
colonies that survived on LB/agar plates containing 
chloramphenicol (5 |JLg/ml) and kanamycin (5 |jLg/ml) 
were selected. Fosmid DNA extracted from these col- 
onies mostly carried a single Tn5 insertion at different 
positions. We sequenced two of these clones by using se- 
quencing primers that annealed to the left or right ter- 
minal regions of Tn5 and were oriented outward. 

Each sequence read contained data for >855 (5 x 
171) nucleotides. Although the tail region of the 
sequence read (distant by >700 nucleotides from 
the sequencing primer) had quite a high possibility of 

^Transposition 
Plasmid 



AS-conlaining clone 




Figure 1 . Strategy for the detection of HOR signs. Transposition of tfie 
Tn5 transposon to DNA molecules of an AS-containing fosmid 
clone was induced, and two clones carrying Tn5 at different 
positions were selected. Each clone was sequenced using two 
primers that annealed to Tn5 and oriented outward. The 
sequence reads (~900 bp, including the tail region of a low 
reliability) were combined, and a longer sequence (~1 800 bp) 
was obtained. These sequences were compared between the two 
clones by the dot-matrix analysis. When a line spanning more 
than two basic units was found, it was regarded as a sign of HOR. 
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containing sequencing errors, this was not a problem 
for our purpose because the data were used for detect- 
ing signs of HOR, but sequencing errors masked such 
signs. By combining the sequence reads for both sides 
ofasingleTnS insertion, we could obtain nucleotide se- 
quence information for a region of>1710(10 x 171) 
nucleotides, and it contained sequence information for 
at least nine consecutive repeat units. 

In each clone, we com pa red two sets of such sequences 
bydot-matrix analysis. When a line spanning more than 
two repeat units was found in the dot matrix, we 
regarded it as a sign of an HOR. The detection efficiency 
of this method depends on the scale of the HOR to be 
revealed, namely, how many (designated as n hereafter) 
basic repeat units constitute the HOR. For HORs with n < 
1 6, our method is expected to be powerful enough to 
detect a sign with only two sets of sequence data. For n 
> 1 7, the probability of detecting a sign with two sets 
of sequence data is 1 S/n. The value of the probability is 
as large as 0.5 for n = 30 and 0.375 for n = 40. If we 
use sequence data of additional insertion points, we 
can increase the probability. 



We examined the AS-containingfosmid clones by the 
transposition assay and subsequent sequencing analysis. 
Of the four Hoolock and six Nomascus clones, one and 
three clones, respectively, exhibited signs of HOR. 
Figure 2 shows examples of the results of this dot- 
matrix analysis. We selected one clone each (clones 
HooFos09 of Hoolock and NomFosl B3 of Nomascus) 
and moved on to determining the long sequences of 
the AS arrays contained in these two clones, and also 
the clone HylFosl 5 from Hylobates. 



3.4. Transfer of fragments to plasm id 

As explained earlier, sequencing strategies relying on 
shotgun segmentation or primer walking are not ad- 
equate for tandemly repeated sequences, such as AS. 
The strategy we employed was preparation of a series 
of deletion clones by using exonuclease III and Mung 
bean nuclease.'^ By digesting the DNA fragment with 
an appropriate combination of restriction endonu- 
cleases {l<pn\ whose 3'-protrudingcuttingsite is resistant 
to the exonuclease, and BamH\ or Sail whose 5'- 
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Figure 2. Dot-matrix analysis of clone partial sequences for signs of HOR. The criterion in the dot-matrix analysis was that a 1 9-nucleotide 
match should exist over a window of 20 nucleotides. Examples of the results are shown. HooFos09 and NomFosl B3 exhibited a line 
spanning more than two basic repeat units, and HooFosOS and NomFos3F5 did not. 
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protruding cutting sites are sensitive), the deletion starts 
at one end of the fragment and continues to various 
degrees. The maximum fragment size for which these 
treatments work smoothly is ~1 6 kb. The size of the 
fosmid clones was 40-44 kb. Therefore, it was neces- 
sary to transfer part of the fosmid clones into a plasmid 
vector. For delimiting the part to be analysed, PCR was 
not adequate because of the repetitive nature of the 
object sequence. 

Restriction endonuclease Not\ recognizes an eight- 
nucleotide segment (GCGGCCGC), which rarely 
appears even in the insert size of the fosmid vector. 
The vector carried two Not\ recognition sites located 
closely to its cloning site, and we manipulated the Tn5 
transposon in advance to have an Not\ site in it. These 
Not\ sites enabled us to determine the position of the 
Tn5 insertion by cutting Tn5-carrying fosmid DMAs 
and running an electrophoresis gel. We checked 1 2 
Tn5-carrying clones originating from the HooFos09, 
HylFosI 5, and NomFoslBS clones, and selected one 
in which Tn5 had been inserted at a position 10- 
1 6 kbdistantfromoneendof its ASarray. We then sub- 
cloned the Notl-delimited DNA fragment into the Not\ 
site of an ampicillin-resistant plasmid (pBluescript II 
SK+), and subsequently transferred the fragment 
to a chloramphenicol-resistant plasmid (originating 
from pHSG399) to which we had, in advance, added 
some restriction enzyme sites necessary for the pre- 
paration of deletion clones. We used the ampicillin- 
resistant plasmid because transfer of a long fragment 
from a chloramphenicol-resistant vector to another 
chloramphenicol-resistant vector becomes easier by 
using an intermediate vector carrying a resistance gene 
to a different antibiotic (ampicillin in this case). The 
plasmid clones carrying the AS fragments from the 
HooFos09,HylFos1 5,andNomFosl B3 clonesaredesig- 
nated HooCam09, HylCamlS, and NomCamlBS, 
respectively. 



3.5. Sequencing analysis 

We prepared deletion clones of various lengths by 
treating HooCam09, HylCamlS, and NomCamlBS 
with exonuclease III and then with Mung bean nuclease. 
We checked the insert size of these deletion clones, 
arranged them in the order of longest to shortest, and 
selected 55, 28, and 41 clones, respectively, to be 
used as sources of partial sequences. The criterion for 
this selection was any two adjacent clones that did 
not differ in insert size by >0.5 kb. Figure 3 shows an 
example of gel electrophoresis in which clones of 5.3- 
0.5 kb were placed according to their sizes. We 
sequencedthe selected clones byusingtheMl 3 univer- 
sal primer. We then manually assembled the obtained 
sequence reads into a contig sequence. 




Figure 3. Confirmation of deletion clone insert size. Deletion clones 
of various sizes were prepared from the HooCam09 and 
NomCamlBS clones. Out of these pools, clones of sizes 
adequate for sequencing analysis were selected. This photograph 
is an example of gel electrophoresis in which deletion clones of 
insert sizes of 5.3-0.5 kb, originating from HooCam09, are 
contained. The 2.1 -kb vector plasmid carried two cutting sites 
for restriction endonuclease Xho\ just outside of the cloning site, 
and the insert fragments in the original clones did not have a site 
for this enzyme. Each deletion clone was digested with Xho\ and 
divided into two fragments: the 2.1 -kb plasmid vector and its 
insert fragment. 

3.6. Evidence for HORs 

We obtained a 1 6 429-bp contig sequence for the 
HooCam09 clone, a 7828-bp contig sequence for the 
HylCaml 5 clone, and a 1 2 093-bp contig sequence 
for the NomCaml B3 clone. These were deposited in 
DDBJ with accession numbers AB861 929, AB9001 1 7, 
and AB846956, respectively. We defined the boundar- 
ies and orientation of the basic repeat unit, so that they 
would be the same as those used in our previous 
studies.^ The three clones contained 9 5, 44, and 
70 basic repeat units, respectively. 

We made pairwise comparisons between basic repeat 
units, and the essence of the obtained results is shown as 
matrices in Fig. 4. In the matrices, red and yellow cells in- 
dicate identities of >95 and 90-95%, respectively. 
Matrices containing the nucleotide identity of each 
cell are shown in Supplementary Figs SI -S3. In 
HooCam09 and NomCaml B3, clear step-like lines con- 
sisting of red cells appeared with a constant interval 
throughout the sequence. The interval was 31 basic 
repeat units for HooCam09 and 9 basic repeat units 
for NomCaml B3. HylCaml 5 showed step-like lines 
exhibiting two types of intervals (8-unit interval for 
units 5-12 and 1 3-unit interval for units 13-44). 
Thus, it is evident that the AS sequences contained in 
these clones are organized into HOR structures. 
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Figure 4. Pairwise comparisons of basic repeat unit sequences. The comparisons were made using the MEGA5 program'' under default 
settings. The horizontal and vertical axes of each matrix represent consecutive basic repeat units contained in the AS contig sequence. 
Cells showing nucleotide identities of 90-95 and > 95% are indicated by yellow and red, respectively. The same matrices containing the 
identity values in the cells are shown in Supplementary Figs SI -S3. 



From the matrices shown in Fig. 4, we picked up cells 
that represented pairs of basic units located at identical 
positions in different HOR units (e.g. the second basic 
repeat unit in the third HOR unit and the second basic 
repeat unit in the fourth HOR unit). In the HooCam09 
clone, the average pairwise nucleotide identity among 
these pairs was 99.8%. The average value among the 
pairs of the other relationships was 89.2%. In the 
NomCamI B3 clone, the values were 98.6 and 88.0%, 
respectively. In the analysis of the HylCamI 5 clone, in 
which the 1 3th-44th basic repeat units (exhibiting a 
1 3-unit interval) were used, we obtained the values of 
97.0 and 86.5%, respectively. These situations are 
similar to those observed in human AS sequences: 
95-99 and 70-90%, respectively.^^ 

3.7. Chromosomal locations 

We conducted FISH analysis of chromosomes, by using 
HooCam09,HylCam1 5, and NomCamI B3 as probes, to 
identify chromosomal locations (Fig. 5). HooCam09 
exhibited strong signals on the centromere region of 
chromosome 4 of /-/oo/oc/^; /joo/oc/<:chromosome spreads. 
HylCamI 5 produced signals in the centromere regions 
of all Hylobates agilis chromosomes. NomCamI B3 



showed clear signa Is on the centromere regions and ter- 
minal regions of most chromosomes of Nomascus siki. 
These results are consistent with the FISH results we 
obtained previously by using a clone of siamang AS as a 
probe.^° 

3.8. Comparison of AS sequences among hominoid 
species 

We made within-species consensus sequences of 
AS repeat units of Hoolock hoolock, Hylobates agilis, and 
Nomascus siki. Furthermore, from references, we ob- 
tained consensus sequence for Nomascus leucogenys, S. 
syndactylus, human, and orangutan. We then made a 
neighbour-joining phylogenetic tree of these consensus 
sequences, which is shown in Fig. 6. The five gibbon 
species representingfourgenera formed aclusterdistinct 
from that of the two hominid species, and the two 
Nomascus species formed an internal cluster within the 
gibbon cluster. 

4. Discussion 

In addition to humans, higher-order AS has also been 
observed in all great ape species examined to date. 
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Chromosome: Hoolock hoolock 
Probe: HooCam09 



Chromosome: Hylobates agilis 
Probe: HylCam15 




Chromosome: Nomascus leucogenys 
Probe: NotnCam1B3 




Figure 5. FISH analysis of chromosomes for locations of AS. The left panels show FISH images obtained using a fluorescence scanner, and the 
right panels are images of the same chromosome spreads stained with 4',6-diamidino-2-phenylindole (DAPI). In the left panels, red signals 
indicate hybridization of the probe DNA to chromosome DNA. The combination of chromosome spread and probe is shown on each left 
panel. The bar in the panels represents 1 0 |j,m. All experimental procedures were the same as those described in our previous work.'" 

although it had not been found in small apes (except for We have demonstrated the presence of high-order AS 

our recent report) or other primate taxa more distant in all of the four genera of small apes. Thus, higher-order 

from humans. Based on this distribution in primates, AS occurs widely in hominoids. The species of the 

it is widely postulated that higher-order AS arose in Symphalangus genus that we previously examined was 

the common ancestor of hominids. shown to have higher-order AS containing four or six 
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Figure 6. Neighbour-joining phylogenetic tree of within-species consensus sequences. The consensus sequences of AS of Hoolock hoolock, 
Hylobates agilis and Nomascus siki were obtained by sending the contig sequences, which we determined in the present study, to the 
Consensus Maker program (version 2.0.0; http://www.hiv.lanl.gov/content/sequence/CONSENSUS/consensus.html). The consensus 
sequences obtained are included in the respective DDBJ files. The consensus sequences for Nomascus leucogenys,^^ Symphalangus 
syndactylus,'^ human^^, and orangutan'' AS were obtained from the respective references. The neighbour-joining phylogenetic tree of 
these within-species consensus sequences was made using the ClustalW program contained in MEGA.^' 



basic repeat units in its HOR units. In the present study, 
we found clear HOR structures in the AS of species of 
Hoolock and Nomascus, in which 31 and 9 basic repeat 
units formed HOR units, respectively. HOR structures 
were also found in a species of Hylobates, although two 
types of HORs (8 and 1 3 basic repeat units) coexisted 
in a single clone. 

Nomascus leucogenys (Northern white-cheeked 
gibbon) has been extensively examined in the past for 
HOR structures by computational analysis of its 
genome sequence database and additional FISH experi- 
ments, but no positive evidence was obtained. In the 
present study, we have identified HOR in AS of N. siki 
(Southern white-cheeked gibbon). These two species 
are so close phylogenetically that they have been 
regarded as subspecies of one species until about a 
decade ago (N. leucogenys leucogenys and N. leucogenys 
siki, respectively). The most probable reason for this dif- 
ference of results would be the difference in the method 
used; we relied on traditional sequencing of long clones 
preceded by detection of HOR signs. Another possible 
factor is the difference in the gibbon species. Satellite 
DNA is known to undergo rapid fluctuation in copy 
number. The HOR-containing AS we found may have 
been amplified in a small number of generations in the 
lineage leading to N. siki (or the individual we used), or 
its copy number may have been reduced rapidly in the 
lineage for N. leucogenys (or the individual used by 
those authors). 

Our strategy i ncl uded a scree n i n g ste p of AS-co nta i n - 
ing fosmid clones for those exhibiting HOR signs, in 
which small parts of candidate clones were sequenced. 
We examined four Hoolock and six Nomascus clones by 
the method newly developed in the present study, 
and one and three clones, respectively, exhibited an 
HOR sign. Considering that our library screening did 
not involve a factor to favour HOR-containing AS 
clones, the HOR structure does not appear to be rare 
in AS sequences contained in their genomes. The lack 
of a sign does not necessarily mean that the AS clones 



examined have no HOR structure. In our method, as 
we described in the Results section, the probability of 
finding a sign in a single fosmid clone is 1 5 /n, where n 
is the number of basic repeat units constituting the 
HOR unit. If n is as large as 1 00, it would be required 
to increase the numberof fosmid clonesto be examined 
orthe numberofTnS insertions in individual clones. We 
did not attempt this in the present study because we 
wanted to develop another, more efficient method. 

Another significance of our results is that we found 
HOR structures in AS residing in the centromere. This 
question remained from our previous study,^ because 
Symphalangussyndactyluscames AS\n terminal regions 
of chromosomes in addition to in centromeres. The 
species of genus Nomc/scus examined so far also contain 
AS in terminal regions.^°'^^ In contrast, such structures 
are not found in Hoolock hoolock and Hylobates agilis,^° 
and, therefore, the possibility that the fosmid clones we 
sequenced originate from chromosomal portions other 
than the centromere would be negligible. 

The currently dominant view about the evolutionary 
origin of HOR-containing AS is that it arose in the 
common ancestor of hominids (family Hominidae). 
The evidence of its widespread occurrence in the 
other family (Hylobatidae) of hominoids requires re- 
consideration of the currently dominant view. Likely 
explanations for the history of HOR-containing AS are 
as follows: (i) HOR-containing AS was present in the 
last common ancestor of hominoids and (ii) HOR-con- 
taining AS emerged in most or all basal branches of 
hominoids. As a more parsimonious explanation, (i) 
would be preferred. 

The neighbour-joining phylogenetic tree of AS 
seq uences is consistent with the widely accepted phylo- 
genetic relationshipof the host species. This neighbour- 
joining phylogenetic tree is, however, not powerful 
enough to extract information concerning the history 
of HOR structures, because the source sequences of 
the gibbon species (except for the case of Nomascus 
leucogenys) are limited to higher-order AS, while the 
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consensus sequences of the hominid species represent 
both higher-order AS and monomeric AS. 

The purpose of the present study was to show evi- 
dence, if any, for HOR-containing AS in multiple 
gibbon genera, which was attained with lines clearly 
appearing in constant intervals in the pairwise com- 
parison tables. It is now of interest whether AS 
sequences of gibbons exhibit within-species variation 
in the HOR structure, and, if so, how large the variation 
is. In hominids, the within-species HOR structure vari- 
ation is larger in humans than in great ape species, 
and the reason for the higher level of variation in 
humans is not known. Furtherstudies by identifying dif- 
ferent types of HOR structures in gibbons are expected 
to contribute to the understanding of factors leading 
to the high level of variation of AS structure in 
humans, and factors that are associated with the more 
important role of higher-order AS than monomeric AS 
in centromere function. 
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